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Abstract 

Consider a set of items and a set of m colors, where each item is associated to one 
color. Consider also n computational agents connected by a ring. Each agent holds 
a subset of the items and items of the same color can be held by different agents. We 
analyze the problem of distributively assigning colors to agents in such a way that 
(a) each color is assigned to one agent only and (b) the number of different colors 
assigned to each agent is minimum. Since any color assignment requires the items 
be distributed according to it ( e.g. all items of the same color are to be held by only 
one agent), we define the cost of a color assignment as the amount of items that 
need to be moved, given an initial allocation. We first show that any distributed 
algorithm for this problem requires a message complexity of fi(n • m) and then 
we exhibit an optimal message complexity algorithm for synchronous rings that in 
polynomial time determines a color assignment with cost at most three times the 
optimal. We also discuss solutions for the asynchronous setting. Finally, we show 
how to get a better cost solution at the expenses of either the message or the time 
complexity. 
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1 Introduction 


We consider the following problem. We are given a set of computational agents connected 
by a (physical or logical) ringjj], and a set of items, each associated to one color from a 
given set. Initially each agent holds a set of items and items with the same color may be 
held by different agents ( e.g. see Fig[Q(a)). We wish the agents to agree on an assignment 
of colors to agents in such a way that each color is assigned to one agent only and that the 
maximum over all agents of the number of different colors assigned to the same agent is 
minimum. We call this a balanced assignment Fig |T] (b) and Fig[I|(c) show two possible 
balanced assignments. Among all such assignments, we seek the one that minimizes the 
total number of items that agents have to collect from other agents in order to satisfy the 
constraints. For example, agent ao in Fig [ 0 (b) is assigned colors V and 4 k, and therefore 
needs just to collect four items colored V, since no other agent has items colored 4 k. 



Figure 1: Three agents: ao, ai, a 2 , and six colors: V, <0, ( v l , A, 4k, Jl». (a) is the initial allocation, 
while ( b ) and (c) are two possible balanced color assignments. Items above the line are those that 
the agent collects from the others. Therefore their total number is the cost of the assignment. 
The assignment in (6) costs (4 x V) + (2 x + 4 x A) + (1 x 4k + 6 x <0>) = 17 items, while the 
assignment in (c) costs (4 x A) + (2 x + 4 x 0) + (5 x V + 1 x &) = 16 items. 


The problem can be formalized as follows. Let A = {a 0 ,... a n _ 1} be a set of n agents 
connected by a ring and let C = {c 0 ,..., c m _ 1} be a set of m colors. Let Q h i > 0 be the 
number of items with color c 3 initially held by agent a*, for every j = 0 , ... ,m — 1 , and 
for every i = 0,..., n — 1. 

Definition 1 (Balanced Coloring). A Balanced Coloring is an assignment n : { 0 ,... ,m — 
1 } —> { 0 ,..., n — 1 } of the m colors to the n agents in such a way that: 

• for every color Cj, there is at least one agent ai such that 7r(j) = i; 

1 An importand example of logical architecture is given by the set of ring shaped nodes of a Distributed 
Hash Table. 
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• for every agent ai, |_^J < \{cj \ ir(j) = i}\ < [— ]; i.e., the number of color 
assigned to agents has to be balanced. In particular, \_—J colors are assigned to 
[(LfJ + l) n — m\ agents, and |_^J + 1 colors to the remaining (m — agents. 

Any Balanced Coloring then assigns almost the same number of colors to each agent, 
and when m is a multiple of n, then each agent is assigned exactly the same number of 
colors. 

Definition 2 (Distributed Balanced Color Assignment Problem). The Distributed Bal¬ 
anced Color Assignment Problem aims at distributively finding a Balanced Coloring of 
minimum cost, where the cost of a Balanced Coloring n : {0,..., m — 1} — > {0,..., n — 1} 
is defined as 


m— 1 n— 1 


Cost(n) = 


( 1 ) 


j =0 *=o, 


The cost of the optimal assignment will be denoted by CostoPT- The approximation 

Cost /\ 

, where Cost^ is the cost of the 


ratio of a sub-optinral algorithm A is the quantity Cost 
solution computed by A. 


OPT 


Motivations. The scenario defined above may arise in many practical situations in 
which a set of agents independently search a common space (distributed crawlers, sensor 
networks, peer-to-peer agents, etc) and then have to reorganize the retrieved data (items) 
according to a given classification (colors), see for example [131 [TD, El]. In these cases, 
determining a distributed balanced color assignment may guarantee specialization by 
category with maximal use of data stored in local memory or balanced computational 
load of agents minimizing the communication among agents. 

A similar scenario may also arise in computational economics [2]. The distributed bal¬ 
anced color assignment formalizes a combinatorial auction problem where agents are the 
bidders and colors represent auction objects. The number of items that an agent holds for 
each color can be interpreted as a measure of desire for certain objects (colors). Balancing 
the number of colors per agent and minimizing the cost guarantees the maximum bidders 
satisfaction. 


The model. We assume that the agents in A = {ao,..., a n _i} are connected by a 
ring: agent a* can communicate only with its two neighbors ap+i) mo d n (clockwise) and 
ap_i) mo d n (anti-clockwise). We assume that each agent knows n (the number of agents), 
and C (the set of colors). Each agent a* is able to compute Pi = max 0 <j< m -i Qj,i inde¬ 
pendently, i.e., the maximum number of items it stores having the same color, while 
p = maxQ<i< n -iPi is unknown to the agents. 

We will consider both synchronous and asynchronous rings, always specifying which 
case we are working with or if results hold for both models. 
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For synchronous and asynchronous rings, we measure message complexity in the stan¬ 
dard way (cf. m HB]), he., we assume that messages of bit length at most clogn, for 
some constant c (called basic messages ), can be transmitted at unit cost. One message 
can carry at most a constant number of agent ID’s. Non basic messages of length L are 
allowed, and we charge a cost d \L/\ogn\ for their transmission, for some constant d. 

For what concerns time complexity, in the synchronous case we assume that agents 
have access to a global clock and that the distributed computation proceeds in rounds. 
In each round any agent can check if it has received a message (sent in the previous 
round), make some local computation, and finally send a message. In the asynchronous 
case agents don’t have access to a global clock, but the distributed computation is event 
driven (“upon receiving message a, take action /3”). A message sent from one agent to 
another will arrive in a finite but unbounded amount of time. 

Throughout the paper we will use the generic term time unit to designate the time 
needed for a message to traverse a link both in the synchronous and asynchronous case: 
for the synchronous case a time unit (also called round or time slot) is the time elapsed 
between two consecutive ticks of the clock; for the asynchronous setting a time unit can 
be any bounded finite amount of time. Nevertheless, in both cases the time complexity 
can be simply measured as the number of time units needed to complete the algorithm’s 
execution. 

Outline of the results. The goal of this paper is to analyze the efficiency with which 
we can solve the Distributed Balanced Color Assignment problem. In Section [2] we discuss 
some related problems and show the equivalence with the so called weighted /3-assignment 
problem in a centralized setting [3]. We also show that a brute force approach that 
first gathers all information at one agent, then computes the solution locally and finally 
broadcasts it, has a high message complexity of 0(mn 2 log pj log n). Fortunately, we can 
do better than this. In Section[3]we give an Vt(mn ) lower bound on the message complexity 
to determine a feasible solution (suitable for both synchronous and asynchronous cases). 
In Section [3] we present an algorithm that finds a feasible solution to the problem whose 
message complexity is O {rrin log rri / log n) , which is then optimal when m is bounded by 
a polynomial in n. 

Interestingly enough, message complexity is never affected by the value p, while run¬ 
ning time is. We then show how to adapt the algorithm to work also in the asynchronous 
case at the expenses of a slight increase in message complexity; this time the messege cost 
depends also on p, but the asymptotic bound is affected only when p is very large (i.e., 
only if p ^ 0(m m )). In Section [5] we show that the proposed algorithm (both synchronous 
and asynchronous versions) computes a Balanced Coloring whose cost is only a factor 
of three off the optimal one, and we also show that the analysis of the approximation is 
tight. Finally, we show that we can find Balanced Colorings with a better approximation 
ratio at the expenses of the message and/or time complexity. 

A preliminary version of this work appeared in [6], In the previous version it was 
assumed that parameter p (the maximum number of items of a given color) was known 
to the computational agents. Since in practical situations it is difficult to have a good 
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estimate of such a global parameter, in this new version we removed this assumption. 
This required both new algorithmic ideas and technical efforts. The algorithm for the 
asynchronous communication model was also not contained in the preliminary version. 
Finally, we enriched the proof of the lower bound with new insights that could be useful 
for further generalizations to different network topologies. 


2 Related problems and centralized version 

In this section we relate the Distributed Balanced Color Assignment problem to known 
matching problems that have been well studied in centralized settings. We will first show 
that when m = n our problem is equivalent to a maximum weight perfect matching 
problem on complete bipartite graphs. On the other hand, when m > n, our problem 
reduces to the weighted ^-assignment problem. 

The class of /3-assignment problems has been introduced by Chang and Lee [3], in the 
context of the problems of assigning jobs to workers, in order to incorporate the problem 
of balancing the work load that is given to each worker. In the weighted ^-assignment 
problem one aims at minimizing the maximum number of jobs assigned to each worker. 

The interested reader can find useful references on these problems, their complexity, 
and related approximation issues in [Q 0] [131 EDI EZ|- 

We associate to agents and colors the complete bipartite graph on n + m vertices, 
which we denote by G — (C,A,C x A). We add weights to G as follows: the weight of 
the edge joining agent a* and color Cj is Q^.%- 

Case m = n. Given a graph (V,E), a perfect matching is a subset M of edges in E 
such that no two edges in M share a common vertex and each vertex of V is incident to 
some edge in M. When edges of the graph have an associated weight, then a maximum 
weight perfect matching is a perfect matching such that the sum of the weights of the 
edges in the matching is maximum. 

Lemma 3. When m = n, a maximum weight perfect matching on G is a minimum cost 
solution to the balanced color assignment problem. 

Proof. Given a perfect matching E C £ — C x A on G, for every (cj,a*) e E we assign 
color Cj to agent oq. As G is complete and E is a perfect matching on G, every color 
is assigned to one and only one agent and vice-versa. Moreover, the cost of any color 
assignment E can be written as Yl e e£\E w ( e )-> an d expression achieves its minimum 
when E is a maximum weight perfect matching. □ 

Finding matchings in graphs is one of the most deeply investigated problems in Com¬ 
puter Science and Operations Research (see ng for a comprehensive description of the 
different variants, theoretical properties, and corresponding algorithms). The best algo¬ 
rithm known to find a perfect matching in a bipartite graph is due to Hopcroft and Karp 
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[10]. and runs in O (J-EjyjVj ) time, where V and E denote the vertex and edge sets, 
respectively. The best known algorithm for finding a maximum weight perfect matching 
is the Hungarian method , due to Kuhn [13], which runs in time 0(n 3 ). 

Case m > n. The /3-assignment problem is defined on a bipartite graph G = ( S,T,E ) 
where ( S,T ) is the bipartition of the vertex set. A /3-assignment of S in G is a subset of 
the edges X C E such that, in the induced subgraph G' = (. S,T,X ), the degree of every 
vertex in S is exactly one. Let /3(A) be the maximum degree, in G', of vertices in T and let 
/3(G) be the minimum value of /3(X) among all possible /3-assignments X. The weighted 
/3-assignment problem consists of finding a /3-assignment X with /3(X) = /3(G) which 
maximizes the total weight of the edges in X. The following lemma is straightforward. 

Lemma 4. The balanced color assignment problem is a weighted /3-assignment of C in 
the complete bipartite graph G = (C, A,C x *4.), with /3(G) = \m/n\. 

The fastest known algorithm to solve the weighted /3-assignment problem is due to 
Chang and Ho [3] and runs in 0(max{|5'| 2 |T|, |S'||T| 2 }) time, which in our case gives the 
bound 0{m 2 n). 

While the maximum weighted perfect matching problem (and its variants) has been 
widely investigated in the distributed setting (see M), no distributed results are known 
for the weighted /3-assignment problem. 

A brute force approach. A brute force distributed solution to the problem can be 
obtained by asking all the agents to send their color information to one specific agent 
(a priori chosen or elected as the leader of the ring); such an agent will then solve the 
problem locally and send the solution back to all the other agents. The factor dominating 
the message complexity of the algorithm above is the information collecting stage. Indeed, 
each agent sends 0{m ) non-basic messages, each corresponding to 0(logp/logn) basic 
messages, through 0{n ) links, on the average. This results in a message complexity of 
0{mn 2 log pj log n). On the other hand, we might think of an algorithm in which each 
agent selects the correct number of colors basing its choice just on local information (e.g. 
its label). This requires no communication at all, but, even if we are able to prove that 
the agents agree correctly on a balanced coloring, we have no guarantee on how good the 
solution is. As we already said, we show that we can do better than this. 

3 Lower bound on message complexity 

In this section we prove a lower bound on the message complexity of the problem that 
applies to both synchronous and asynchronous rings. 

We prove the lower bound on a particular subset X of the instances of the problem. 
Let n be even and let m = (nt)/ 2, for some integer t. Since we are only interested in 
asymptotic bounds, for the sake of simplicity, we will also assume that m is a multiple of 
n, i.e. t/2 = m/n is an integer. 
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For any agent a;, let ay denote the agent at maximum distance from a* on the ring. 
In the following we say that a color is assigned to the pair (a*, a*/), for i — 0, ..., n/2 — 1, 
to mean that it is assigned to both agents of the pair. We also say that a set C of colors 
is assigned to agent a iff all the colors in C are assigned to a. 

Let {C 0 ,... ,C n / 2 — 1 } be a partition of the set of colors such that \Cj\ = t for all j = 
0,..., n/ 2—1. Set X consists of all instances of the Distributed Balanced Color Assignment 
Problem such that for any i = 0,..., n/2 — 1, the following two conditions hold: 

(a) for any color j E Ci = {ii,... ,it} both agents of pair (a*, a*/) hold at least one item 
of color j, i.e. Q 3 . t > 0, Q 3 .y > 0; 

(b) neither a* nor ay hold colors not in C, : . 

Lemma 5. Given an instance in X , any optimal solution assigns to (at, ay) only colors 
from set Ci, for i = 0 ,..., n /2 — 1 . 

Proof. Consider any solution to an instance from set X that assigns to the agent a* a color 
h 0 initially held by some pair (a*,, ay), with k ^ i. Since any optimal solution is perfectly 
balanced on input instances of X, there must be at least one color h\ initially stored in 
(ai,ay) that is assigned to some other agent, say a p . The same argument can in turn be 
applied to a p and so on until (since the number of colors/agents is finite) we fall back on 
Ofc. Formally, there exists 0 < k < n/2 — 1, k ^ i, such that ho E C k 7 ^ C*, and a sequence 
of indices k 0 , ki ,..., k j, with k 0 = k, k\ = i and /q +] = k, such that 

• color h 0 G Ffc 0 (= Ck) is assigned to agent ak 1 (= a*); 

• color hi E C kl (— Ci) is assigned to agent a k2 ] 


• color hi G Cfc, is assigned to agent a kl+1 (= «fc)- 

Let Cost 1 denote the cost of such a solution and let T be the contribution to the cost 
given by colors different from h 0 , hi ,..., hi- Then, recalling condition (b) of the definition 
of X, we have 


n— 1 n— 1 n— 1 

Costi = r + E Qho,w “1“ ^ ^ Qhi,w H - * ’ ’ E Qh t ,w 

w=0, w=0, w=0, 

w^k2 w^ki_ 

— r+ ( Qh 0 ,k 0 + Qh 0 ,k' 0 ) +-1* ( Qh u ki + Qhi,k[)- 

Consider now a solution that differs from the previous one only by the fact that every 
color in C w is assigned to agent a w for w — k 0 , k \..., ki- Namely, 


• h 0 E Ck 0 is assigned to ak 0 ; 

• hi E Ck 3 is assigned to a kl ; 
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• hi G Ck t is assigned to a*;,. 


This is clearly a perfectly balanced solution, since each agent “loses” and “gains” 
exactly one color with respect to the previous case. Letting Cost 2 be the cost of such a 
solution, we have 


Cost 2 


72 — 1 72—1 

F ^ ^ Qho,w ”t“ ’ * * “t - ^ ^ Qh[,w 

w= 0 , w=0, 

w^kQ w^ki 


r + Qho,k ' 0 + • • • + Qhi,k[- 


Hence, 


Costi — Cost 2 — Qh 0 ,k 0 + • • • + Qh t ,ki > 0, 

where the inequality follows from condition (1) of the definition of I. □ 

We now consider two specific instances in I that will be used in the following proofs. 
For each pair (a,;, a,;/), for i — 0,... ,n/ 2 — 1, and its initially allocated set of colors 
Ci = {ii,...,}, fix any u > 1 and partition set C* into subsets C and C", each of 
cardinality t/ 2. We define instance Zi G X for the pair (a,, a*/) in the following way: 

Zi : = u for each j G Cj 

Qj,i’ = Qj,i = u for each j e C' 

Q J t i = Qj : i + 1 = u + 1 for each j e C" 

Hence, by construction, instance X\ has the property that for any j G C", > Qj,i- 

Example 6. Consider a pair (a*, a*/) with a set of colors Ci = {1, 2, 3,4, 5, 6, 7, 8}. Let 
u — 2. If C = {2,4, 5,8} and C" = {1,3, 6, 7}, f/ien instance Z\ will be as follows: 


colors 

1 

2 

3 

4 

5 

6 

7 

8 

ff items for 

2 

2 

2 

2 

2 

2 

2 

2 

ff items for ay 

3 

2 

3 

2 

2 

3 

3 

2 


In the following lemma we will show that the only optimal solution to T\ is the one 
that assigns C to at and C" to ay. The above example gives an intuition of the formal 
proof. By Lemma [5j we know that only the items that need to be exchanged between a* 
and ay account for the cost of the optimal solution, and the latter is achieved by moving 
items with weight 2 (those highlighted in bold in the table), i.e., by assigning C to a« and 
C" to ay, for a total cost of 16. 

Lemma 7. The only optimal solution to instance T\ is the one that assigns C to ai and 
C" to ay. 













Proof. We first compute the cost of this solution: 


Cost ^ ^ Qj,i f ~~f~ ^ ^ Qj,i ^ ^ Qj,i ~h ^ ^ Qj,i ^ ^ Qj,i‘ 

j&C' j&C” jeC' j&C” j&Ci 

Consider any other partition of C* into two sets C and C". Consider another solution 
that assigns C to a, and C" to av and let us compute the cost of this new solution: 


— El Qj,i' + 

El Q?',* 



j'ec 7 

j&C 77 



= El Qj,i' 

+ E 

i' + 

E Qti 

j&ync 1 

jeWnC" 


jec 77 

= Q ' h ' 1 

+ El 

7 + 

El Qjj 

j&cJnC' 

jeiynC" 


j'gc 77 

= E 

c El 

Qj 


jeCi\(C'nC") 

jeCnC" 



V 

M 

Q C El 

Qj 

t i = Cost, 

jeCi\(C'nC") 

j&?nc n 




where the inequality follows by observing that 

• there is at least one jE.Cn C", otherwise the two partitions would coincide; 

• on instance T\ we have that for every j E C" , Q h ,/ > Qj. t . 

□ 

We now define the instance I 2 6l for the pair (a 2 , a*/) in the following way: 

Z 2 : Qjy = u for each j E Cj 

Qj,i = Qj,i' — u for eac h 3 £ C 

Qj,i = Qj,i' — 1 = u — 1 for each j E C" 

where C), C', C", and u are set as before. By construction, instance Z 2 has now the property 
that for any j E C", Qjy < Qj ti . 

Example 8. Consider again the pair (a*, a*/) on the same set of colors Ci and same 
partition C = {2,4, 5,8}, C" = {1, 3,6,7}, and same u = 2, exactly as in Example [3 
Instance To will be as follows (the cost of the optimal solution is equal to 12 and highlighted 
in bold): 


colors 

1 

2 

3 

4 

5 

6 

7 

8 

f) items for 

2 

2 

2 

2 

2 

2 

2 

2 

j) items for a,/ 

1 

2 

1 

2 

2 

1 

1 

2 
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Observe that, from at point of view, instances X\ and Z 2 are indistinguishable. Nev¬ 
ertheless, the optimal solution for instance Z 2 is to assign to a* the complement set of 
indices with respect to the optimal solution to instance X\. 

Analogously as the previous lemma we can prove the following result. 

Lemma 9. There is only one optimal solution for instance X 2 : assign colors in C to a t > 
and colors in C" to a*. 

Proof. The proof is very similar to that of Lemma [0 The cost of the solution defined in 
the statement is now: 

Cost = Qj.i + Qj,i' — /w Qj,i’ + yi Qj,t’ = y ] Qj# ■ 

jec jec" jec j&C" jeCi 

The cost of any other solution is calculated as in the proof of Lemma [7| with the exception 
that now instance Z 2 has the property that for any j e C", Qj$ ^ Qj,i- I—I 

The core of the lower bound’s proof lies in the simple observation that agent at is 
not able to distinguish between instance Z x and instance Z 2 without knowing also the 
quantities Qjy for colors j falling into partition C". 

Lemma 10. If agent at knows at most t /2 colors held by a^, it cannot compute its optimal 
assignment of colors. 

Proof. Construct a partition of C* in the following way: place index j in C 1 if a* has 
knowledge of Qj^ and in C" in the other case. If the cardinality of C is smaller than 
i/2, arbitrarily add indices to reach cardinality i/2. Agent a, cannot distinguish between 
instances X\ and Z 2 constructed according to this partition of Q and, hence, by lemmas 
[7] and [9] cannot decide whether it is better to keep colors whose indices are in C or in 
C". Finally, observe that in both instances indices in C are exactly in the same position 
in the ordering of the colors held by a*/, thus the knowledge of these positions does not 
help. □ 

Theorem 11. The message complexity of the distributed color assignment problem on 
ring is Vt(mn). 

Proof. Let A be any distributed algorithm for the problem running on instances in X. By 
the end of the execution of A, each agent has to determine its own assignment of colors. Fix 
any pair (a;, a*/) and consider the time at which agent a* decides its own final assignment 
of colors. Assume that at this time a* knows information about at most t /2 = m/n colors 
of agent a,/. By Lemma [TOj it cannot determine an assignment of colors for itself yielding 
the optimal solution. 

Therefore, for all n /2 pairs (a*, a*/), agent a* has to get information concerning at least 
m/n of the colors held by a*/. We use Shannon’s Entropy to compute the minimum number 
of bits B to be exchanged between any pair (a*, a*/) so that this amount of information is 
known by Oj. We have: 
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B = log 


m 


Using Stirling’s approximation and the inequality m > n, we get 


B ~ 


> 


m m ■ n 

— ■ log-;— 

n m + n 

m , n ^ 

— ■ log 

n 2 



As a basic message contains logn bits, any pair ( di,a *,) needs to exchange at least 
Vt{m/n) basic messages. Each such message must traverse n/2 links of the ring to get to 
one agent of the pair to the other. As we have n/2 pairs of agents, the lower bound on 
message complexity is given by 


Vt{m/n) 


n 

2 


— G kl{m • n) 


□ 


4 A distributed message-optimal algorithm 

In this section we first describe an algorithm that exhibits optimal message complexity 
on synchronous ring. We will then show how to adapt the algorithm to the case of an 
asynchronous ring. In the next section we will prove that the algorithm is guaranteed to 
compute an approximation of the color assignment that is within a factor three from the 
optimal solution (for both synchronous and asynchronous ring). 

4.1 Synchronous ring 

At a high level, the algorithm consists of three phases: in the first phase, the algorithm 
elects a leader a 0 among the set of agents. The second phase of the algorithm is devoted 
to estimate the parameter p = rnaxj max,, Qj.i, i.e. the maximum number of items of a 
given color held by agents. Finally, the last phase performs the assignment of colors to 
agents in such a way to be consistent with Definition [Q In the following we describe the 
three phases in detail. 

Algorithm Sync-Balance 

Phase 1. The first phase is dedicated to leader election that can be done in 0(n) 
time with a message complexity of 0(n\ogn) on a ring of n nodes, even when the nodes 
are not aware of the size n of the ring 113- 

Leader election has also been studied in arbitrary wired networks [7J. An 0{n 
polylog(n)) time deterministic algorithm is available even for ad hoc radio networks of 
unknown and arbitrary topology without a collision detection mechanism, even though 
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the size of the network must be known to the algorithm code (see [5] for the currently 
best result). 

Without loss of generality, in the following we will assume that agent ao is the leader 
and that ai, a 2 ,a n -i are the other agents visiting the ring clockwise. In the rest of 
this paper, we will refer to agent Oj_i mo d n (resp. a i+ i mo d „) as to the preceding (resp. 
following ) neighbor of a*. 

Phase 2. In this phase agents agree on an upper bound p' of p such that // < 2 p. 
Given any agent a* and an integer r > 0, we define: 

{ 1 if maxj Qj ti = 0 and r = 0; 

1 if T < maxj Q 3)l < 2 r+1 and r > 0; 

0 otherwise. 

This phase is organized in consecutive stages labeled 0,1,... At stage r = 0, the leader 
sets an integer variable A to zero, which will be updated at the end of each stage and 
used to determine when to end this phase. 

In stage r > 0, agent a*, for i — 0,l,...,n—1, waits for i time units from the beginning 
of the stage. At that time a message M might arrive from its preceding neighbor. If no 
message arrives, then it is assumed that M = 0. Agent a* computes M = M + B^r) and, 
at time unit i + 1, sends M to its following neighbor only if M > 0, otherwise it remains 
silent. 

After n time slots in stage r, if the leader receives a message M < n from the preceding 
neighbor, then it updates variable A = A + M, and, if A < n, proceeds to stage r + 1 
of Phase 2; otherwise it sends a message clockwise on the ring containing the index of 
the last stage £ performed in Phase 2. Each agent then computes p l = 2 £+1 , forwards the 
message clockwise, waits for n — i + 1 time units and then proceeds to Phase 3. 

Lemma 12. Phase 2 of Algorithm Sync-Balance computes an upper bound p' of p such 
that p' < 2 p within 0(n\ogp) time units and using 0(n 2 ) basic messages. 

Proof. We will say that agent a, speaks up in stage r when i?*(r) = 1. Throughout the 
execution of the algorithm, integer variable A records the number of agents that have 
spoken up so far. 

Any agent a* speaks up in one stage only. Indeed, given the color j' for which agent a* 
has the maximum number of items, then -Bj(r) = 1 only at stage r such that Qp.i falls in 
the (unique) interval [2 r ,2 r+1 ). Let a*, be the agent having the largest amount of items 
of the same color among all agents, i.e., such that = p, for some j* G [0 ,m — 1]. 

Then Bi*(r ) = 1 for stage r such that 2 r < p < 2 r+1 , i.e., agent a** speaks up when r = £. 
Observe that at the end of stage £ the leader sets A = n, as all n agents must have spoken 
up by that time. Therefore, considering also the last extra stage in which the agents are 
informed of the value of £, Phase 2 ends after £ + 2 stages, i.e. n(2 + logp) time units. 

For what concerns message complexity, in each stage, for r = 0,... ,logp, either no 
messages are sent, or a message traverses a portion of the ring. Observe that, as each agent 
speaks up only once during this phase, messages circulating on the ring must always be 
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originated by different agents. Hence, the number of stages in which a message circulates 
on the ring is at most n and there must be at least max{0,logp — n} silent stages. In 
conclusion, Phase 2 message complexity is bounded by 0(n 2 ). 

As for the ratio between the actual value of p and its approximation p' computed in 
Phase 2, by construction we have that 2 1 <p and 


p' = 2 £+1 = 2 • 2 £ < 2 p. 


□ 


Phase 3. As a preliminary step, each agent a* computes the number of colors it will 
assign to itself and stores it in a variable /Q. Namely, each agent a,, for i = 0,1, ..., n — 1 
computes g = (|_—J + l) n — m and then sets /Q as follows (recall Definition [[]): 

JC. = ! LfJ if * <05 
1 Lf J + 1 Otherwise. 

In the rest of this phase, the agents agree on a color assignment such that each agent a* has 
exactly /Q colors. Algorithms Q] and [2] report the pseudo-code of the protocol performed 
by a general agent a* in this phase and that is here described. 

Let p' be the upper bound on p computed in Phase 2. Phase 3 consists of logp' + 1 
stages. In each stage r, for r = 0,..., logp', the agents take into consideration only colors 
whose weights fall in interval I r = [l r ,u r ) defined as follows: 



2", +°°) > 

2 ^r, for 0 < r < log p' 

0 , 1 ) 


(3) 


Observe that in consecutive stages, agents consider weights in decreasing order, as 

t.l: y | | ^ Jj y t 


At the beginning of each stage r, all agents have complete knowledge of the set of 
colors C r -1 that have already been assigned to some agent in previous stages. At the 
beginning of this phase, C_i is the empty set, and after the last stage is performed, C\ ogp i 
must be the set of all colors. 

Stage r is, in general, composed of two steps; however, the second step might not 
be performed, depending on the outcome of the first one. In the first step, the agents 
determine if there is at least one agent with a weight falling in interval I r , by forwarding 
a message around the ring only if one of the agents is in this situation. If a message 
circulates on the ring in step one, then all agents proceed to step two in order to assign 
colors whose weight fall in interval I r and to update the set of assigned colors. Otherwise, 
step two is skipped. Now, if there are still colors to be assigned (i.e., if C r j- C), all agents 
proceed to stage (r + 1); otherwise, the algorithm ends. In more details: 

Step 1 . Agent a* (leader included) waits i time units (zero for the leader) from the 
beginning of the stage, and then acts according to the following protocol: 
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Case 1: If a* receives a message from its preceding neighbor containing the label k of 
some agent a*,, it simply forwards the same message to its following neighbor and 
waits for (n + k — i — 1) time units; 

otherwise 

Case 2: If a,; has a weight falling into interval J r , then it sends a message containing its 
label i to its following neighbor and waits for (n — 1) time units; 

otherwise 

Case 3: It does nothing and waits for n time units. 

If Case 1 or Case 2 occurred, then agent a; knows that step 2 is to be performed and 
that it is going to start after waiting the designed time units. 

Otherwise, if Case 3 occurred, after n units of time, agent a* might receive a message 
(containing label k) from its preceding neighbor, or not. If it does, then a* learns that 
Case 2 occurred at some agent a*, having label k > i and that step 2 is to be performed. 
Hence, it forwards the message to its following neighbor in order to inform all agents 
having labels in the interval [i + 1,..., k — 1], unless this interval is empty (meaning that 
a* was the last agent to be informed). Then, after waiting for another (k — i — 1) time 
units, agent a* proceeds to step 2. On the contrary, if got no message, it learns that 
Case 2 did not occur at any agent and hence, step 2 needs not be performed. After 
waiting for (n — i ) time units, a* can proceed to the next stage (r + 1). 

Observe that, when step 2 has to be performed, step 1 lasts exactly n + k — 1 < 2n 
time units for all agents, where k is the smallest agent’s label at which Case 2 occurs, 
while it lasts exactly 2 n time units for all agents in the opposite case. Indeed, referring 
to the pseudo-code in Algorithm [TJ completion time is given by the sum of the time units 
in the following code lines: in Case 1 of lines 7 and 10 {i ^ k)\ in Case 2 of lines 7 and 15 
{i — k)\ in Case 3 of lines 7, 18 and 22 if agents proceed to Step 2 (i ^ k), and lines 7 
and 26 otherwise. 

As the time needed by agents to agree on skipping STEP 2 is larger than the time 
needed to agree in performing it, it is not possible that some agent proceeds to STEP 2 
and some other to stage (r + 1). On the contrary, agents are perfectly synchronized to 
proceed to step 2 or stage (r + 1). 

Step 2. When this step is performed, there exists a non empty subset of agents having 
at least one weight falling into interval I r . Only these agents actively participate to the 
color assignment phase, while the others just forward messages and update their list of 
assigned colors. Color assignment is done using a greedy strategy: agent a* assigns itself 
the colors it holds which fall into interval I r and that have not been already assigned to 
other agents. Once a color is assigned to an agent, it will never be re-assigned to another 
one. 

To agree on the assignment, the agents proceed in the following way: agent a; creates 
the list Ci )T of colors it holds whose weights fall into interval I r and that have not been 
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assigned in previous stages. Then, a* waits i time units (zero for the leader) from the 
beginning of the step. At that time, either a* receives a message At from its preceding 
neighbor or not. In the first case, the message contains the set of colors assigned in 
this stage to agents closer to the leader (obviously, this case can never happen to the 
leader). Agent a* then checks if there are some colors in its list C i r that are not contained 
in Af (empty message in the case of the leader), and then assigns itself as many such 
colors as possible, without violating the constraint /C* on the maximum number of colors 
a single agent might be assigned. Then, at updates message M. by adding the colors it 
assigned itself, and finally sends the message to its following neighbor. If Ci )T is empty, or 
it contains only already assigned colors, a* just forwards message A4 as it was. In both 
cases, a* then waits for a new message AT that will contain the complete list of colors 
assigned in this stage. AT is used by all a* to update the list of already assigned colors 
and is forwarded on the ring. When the message is back to the leader, stage (r + 1) can 
start. 

Lemma 13. Let K r be the number of colors assigned in stage r of Phase 3, then stage r 
can be completed in at most 0{n ) time units using at most O (^n ■ — r ^° s n m j basic messages. 

Proof. The bound on the time complexity follows straightforwardly by observing that 
each of the two steps requires at most 2 n time units. 

For what concerns message complexity, Step 1 requires no messages if Step 2 is 
skipped, and n — 1 otherwise. In fact, only one basic message goes clockwise on the ring 
from ak to ak~ i, where k is the smallest index at which Case 2 occurs. The worst case for 
Step 2 is the case in which the leader itself assigns some colors, as a possibly long message 
containing color ID’s must go twice around the ring. As there are m colors, one color 
can be codified using logm bits, then, sending K r colors requires no more than A j o 1 °^” t 
basic messages. In conclusion, the total number of basic messages is upper bounded by 

V □ 

\ logn J 

Corollary 14. Phase 3 of Algorithm Sync-Balance can be completed within O(nlogp) 
time units and using 0{nm ■ y^) basic messages. 

Proof. It will suffice to sum up the worst cases for message and time complexity from 
Lemma fl3l over all stages r = 0,..., logy/, where p' < 2 p (Lemma fl2jl . 

The upper bound on the time complexity is straightforward. Let K r be defined as 
in the statement of Lemma [131 i.e. as the number of colors assigned in a generic stage 
r of Phase 3. The upper bound on the message complexity follows by observing that 
Y^=o K r = m, as the total number of assigned colors during the logy/ + 1 stages is 
exactly the given number of colors. 

□ 

We are now ready to prove that our algorithm is correct. In Section [5] we will evaluate 
the ratio of the cost of the solution found by this algorithm and the one of the optimal 
solution. 
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Algorithm 1 Sync-Balance - Phase 3 (performed by agent a,) 


Require: p' computed in Phase 2 > upper bound to maximum number of items of the same 

color 

1: Compute /C; t> Number of colors cp has to be assigned, as defined in Equation ([2]) 

2: C -1 -<—0 t- set of colors assigned up to the previous stage 

3: for r = 0 to logy/ do 

4: Ci tr = {cj | Cj C r - 1 and Qjj G I r } t> Colors assignable to Oj in stage r. Intervals I r are 

defined in © 

> Begin of Step 1 


5: Wait i time units 

6: if Got message At = {k} from its preceding neighbor then 


> Case 1 
> k < i is an agent label 


7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 


> proceeds to Step 2 


> Case 2 


> proceeds to Step 2 
> Case 3 


Forward message At to its following neighbor 
Wait n — i + k — 1 time units 
Step 2 
else 

if 7 ^ 0 then 

Send message At = {z} to its following neighbor 
Wait n — 1 time units 
Step 2 
else 

Wait n time units 

if Got message At = {A"} from its preceding neighbor then 

if k — i — 1 > 0 then > informs other agents that Step 2 is to be performed 
Forward message At to its following neighbor 
Wait for k — i — 1 

Step 2 > procedure call to Step 2 

end if 
else 

Wait n — i time units t> proceeds to next stage skipping Step 2 

end if 
end if 
end if 
end for 
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Algorithm 2 Sync-Balance - Phase 3 Step 2 (performed by agent a*) 


1: procedure Step 2 
2 : Wait i time units 

3: if Got message At from preceding neighbor with list of colors then 

4: £i tr <— Ci^r \ At > list of candidate colors to self assign 

5: else 

6 : Create empty message At 

7 : end if 

8 : if \£i >r \ 0 then 

9: Self assign maximum number of colors among those in C, % r 

> the total number of colors a* can assign itself is given by /Q 
10 : Add self assigned colors to At 

11: end if 

12: if At 7^ 0 then 

13: Send message At to the following neighbor 

14: end if 

15: Wait for message At' from preceding neighbor with list of colors 

16: C r <— C r -1 U At' > updates set of assigned colors 

17: Forward message At' to the following neighbor 

18: if C r = C then > all colors have been assigned 

19: stop 

20: else 

21: Wait for n — i time units 

22: end if 

23: end procedure 
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Theorem 15. Assuming m e 0(n c ), for some constant c, Algorithm Sync-Balance 
finds a feasible solution to the balanced color assignment problem in time 0(n logp) using 
0 (mn) messages. 

Proof. To prove correctness, we show that any assignment of colors to agents computed 
by algorithm Sync-Balance satisfies the two following conditions: 

(i) A color Cj cannot be assigned to more than one agent. 

(ii) All colors are assigned. 

(*) The algorithm can assign a new color Cj to agent a* only in line 9 of Algorithm [2j This 
can only happen if c 3 has not been already assigned in a previous stage, or in the current 
stage to an agent with smaller label. Since, in the stage, the color assignment is done 
sequentially (starting from the leader and following the ring clockwise), no color can be 
assigned to two different agents. Moreover, in lines 15-17 of Algorithm^ all agents update 
the list of colors assigned in the current stage and, hence, in later stages, already assigned 
colors will not be assigned again. Therefore Sync-Balance prevents the assignment of 
the same color to two different agents. 

(ii) If an available color c 3 of weight Q J3 e [l r ,u r ) is not taken by a* during stage r, it 
is only because a* has enough colors already (line 9). However, this circumstance may 
not occur at all agents during the same stage (for this would imply that there were more 
than m colors). Thus, either the color is taken by a higher labeled agent in stage r, or is 
“left free” for agents for which the weight of c 3 is less than l r . By iterating the reasoning, 
we may conclude that, if not taken before, the color must be eventually assigned in stage 
[logp] + 1, where agents are allowed to pick colors for which their weight is zero. 

As for upper bounds on time and message complexities, by summing up upper bounds 
for the three phases, we have 


Time complexity: 


0(n) + 0(n logp) + 0(n logp) = 0(n logp), 

Phase 1 Phase 2 Phase 3 


Message complexity: 0(n logn) + 0(n 2 ) + O ( nm ■ 


Phase 1 Phase 2 


logm 

logn 


= 0(nm), 


-v- 

Phase 3 


where we used Lemma [12l Corollary [T4J and the facts that m > n and that 
logm/logn G 0(1), under the given hypothesis. 

□ 


4.2 Asynchronous ring 

In an asynchronous ring such instructions as ’’wait for i time units” (see Algorithm Q] 
and [2)) cannot guarantee a correct completion of the global algorithm. Here we discuss 
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how to make simple modifications to Sync-Balance in order to get an algorithm (named 
Async-Balance) that correctly works in the asynchronous case as well. 

The leader election in Phase 1 can be done in O(n) time with a message complexity of 
O(nlogn) even on an asynchronous ring of n nodes [IT] . Therefore, the main differences 
are in Phase 2 and Phase 3. 

In Phase 2 we propose a slightly different strategy that works in only 2 stages, in¬ 
stead of logp. This better time complexity translates, in general, into an extra cost in 
terms of message complexity. Nevertheless, under reasonable hypothesis (namely when 
p E 0(m m )), the message complexity reduces to the same bound as for the synchronous 
setting. 

Finally, in Phase 3, the main ideas remain the same, but there are no “silent stages” 
and the leader acts differently from the other agents, as it is the one originating all 
messages circulating on the ring. 

In the following we highlight the main differences with the synchronous protocol: 

Algorithm Async-Balance 

Phase 1. Leader election can be accomplished with an O(nlogn) message complex¬ 
ity di]. 

Phase 2. This phase consists of only two stages. In the first stage the agents compute 
p = max,; rnaXj Q hl . Let p, = rnaxj Qj.i, i.e. the maximum number of items of the same 
color agent a* posseses. The leader originates a message containing p 0 . Upon reception 
of a message M from its preceding neighbor, agent a, computes M = max{M, p,} and 
forwards M to its following neighbor. The message that gets back to the leader contains 
p and it is forwarded once again on the ring to inform all agents. 

Observe that Phase 2 requires no more than O [n ■ basic messages, as 

0 (logp/ logn) basic messages are needed to send the p*’s and p. 

Phase 3. Changes in this phase concern both the execution of step 1 and step 2, that 
are to be modified in the following way: 

Step 1. Each agent a, computes its list of assignable colors C^ r and sets Y t (r) = 1 if 
\Ci, r \ > 0, and Yi{r) = 0 otherwise. The leader starts the step by sending, to its following 
neighbor, a basic boolean message containing Yo(r). Upon reception of a message M 
from its preceding neighbor, agent a, computes M = M V b(r) and forwards M to its 
following neighbor. When the leader gets the message back, it forwards the message again 
on the ring, and the same is done by all agents, until the message arrives to a n _i. The 
second time one agent (leader included) gets the message, it checks its content: if it is a 
one, then it knows that it has to proceed to Step 2; otherwise, if it contains a zero, it 
proceeds to the next stage. 

Step 2. The leader starts the step by sending, to its following neighbor, a (possibly 
empty) list of self assigned colors, obtained exactly as in the synchronous case. Then 
agents act as in the synchronous protocol, with the exception that they are activated by 
the arrival of a message from the preceding neighbor and not by a time stamp. Agents 
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proceed to the next stage after forwarding the complete list of colors assigned in the 
stage. 

Lemma 16. Let K r be the number of colors assigned in stage r of Phase 3, then stage r 
can be completed using at most O (Vi • basic messages. 

Proof, step 1 is always performed and a basic message is forwarded (almost^) twice 
around the ring. Hence, O(n) basic messages are used. When step 2 is performed, a 
message containing color ID’s goes (almost) twice around the ring. Analogously to the 

synchronous case, we can prove that no more than O ^n • A [ 0 VV''' j messages are needed. 

□ 

Analogously to the synchronous case, we can prove the following corollary. 

Corollary 17. Phase 3 of Algorithm Async-Balance can be completed using 
O (nm ■ basic messages. 

Theorem 18. Assuming m e 0{n c ), for some constant c, Algorithm Async-Balance 
finds a feasible solution to the balanced color assignment problem, on asynchronous rings, 
within time 0{n\ogp ) using O (n ■ + nrn^j basic messages. 

Proof. The correctness proof is analogous to the synchronous case. 

The time complexity is asymptotically equivalent to the synchronous case. Indeed, as 
already mentioned, the leader election in Phase 1 can be completed in O(n) time, Phase 
2 requires 2 circles around the ring and, finally, Phase 3 includes O(logp) stages, each of 
them requiring 2 circles around the ring. 

For what concerns message complexity, summing up upper bounds for single phases, 
we get 


Ofnlogn ) + O I n ■ ^ ) _|_ O 

' log n J 


nm ■ 


logm 

logn 


= O 


log p 

n ■ - -B nm 

log n 


as m E 0{n c 


□ 


When we also have that O(logp) = 0(m logm), the algorithm exhibits the same 
optimal message complexity as in the synchronous setting. Namely, we can state the 
following result. 

Corollary 19. If m 6 0(n c ), for some constant c, and p 6 0{m m ), then Algorithm 
Async-Balance finds a feasible solution to the balanced color assignment problem, on 
asynchronous rings, using Q(nm ) messages. 

2 On the second stage, agent a n -1 stops the message. 
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5 Approximation Factor of Algorithm Balance 

The main result of this section is that the cost of the solution (as defined in Definition [2]) 
computed by the algorithms presented in the previous sections is at most a small constant 
factor larger than the cost of the optimal solution. Namely, we will show that it is at 
most three times the optimal solution and that the analysis is tight. Moreover, we will 
show how to modify the algorithm to get a 2-approximation ratio at the expenses of a 
little increase of message complexity, and, for the synchronous case only, how to get a 
(2 + e)-approximation ratio (for every 0 < e < 1) at the expenses of an increase in time 
complexity. 

Since, under the same assumptions of Corollary [19l the cost of the solution is the 
same both in the synchronous and asynchronous versions (the assignment of colors is 
exactly the same in both cases), in this section we will address both Sync-Balance and 
Async-Balance with the generic name Balance. In the following some results are ex¬ 
pressed in terms of the value p' (respectively, p) computed by the agents in the syn¬ 
chronous (resp. asynchronous) case during Phase 2 of the algorithm. As these results 
hold for both p' and p , to avoid repeating the distinction between p ' and p over and over 
again, we will indicate with p both values p' and p. 

We begin with the following lemma: 

Lemma 20. Let color Cj be assigned to agent a* in stage r (of Phase 3) by algorithm 
Balance. Let a *. be a different agent such that Qj^ G [l r ,u r ). Then Q, J:l < 2 • Qj 

Proof. If r = |~logp] + 1 (i.e., is the last stage), then it must be Qj }i = Qj y k = 0, and we 
are done. Otherwise, as Cj is assigned to agent a* in stage r then it must be Qjj G [Z r ,w r ) 
and the thesis easily follows from 


P 

2 r+1 


— Qj,ii Qj,k < '- 


P_ 
2 r " 


□ 


Let B : {1,..., m} —> {1,..., n} be the assignment of colors to agents determined by 
algorithm Balance, and let OPT : {l,...,m} —>■ {1, ...,n} be an optimal assignment. 
Define a partition of the set of colors based on their indices, as follows: 


• C = {j | B (j ) = opt(j)}; i.e., color indices for which the assignment made by 
algorithm Balance coincides with (that of) the optimal solution. 

• C" = { 0 ,..., m — 1} \ C\ i.e., colors indices for which the assignment made by 
algorithm Balance is different from the one of the optimal solution. 


Lemma 21. Assume C" is not empty (for otherwise the assignment computed by Balance 
would be optimal) and let j G C". Let k j be any other color index in C" such that 
B(k) = OPt(j). Then 

Qj,OPT(j) — max{2 • Qji Qk■ 
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Proof. First observe that, as j,k G C" and B(k) = OPt(j') ^ B(j), we have that B(j) ^ 

B(k). 

If Qj,OPT(j) < Qk,B(k ) we are clearly done. Suppose now that QyoPT(i) > Qkp{k), then 
we can prove that QyoPT(j) < 2 • Q 3 M(j) ■ 

The fact that j E C" means that Balance assigned color Cj to a different agent com¬ 
pared to the assignment of the optimal solution. Let r be the stage of Balance execution 
in which agent OPt(j) processed color c 3 (i.e., Qj : oPT(j) G h) and could not self assign 
Cj, then (in principle) one of the following conditions was true at stage r: 

1. opt(j) already reached its maximum number of colors before stage r. 

However, this is impossible. It is in fact a contradiction that OPT(j) gets color c*, 
(recall that OPt(j') = B [k)) but does not get color Cj under Balance, since we are 
assuming (2j,OPT(j) > Qfc.OPTd) > which means that the assignment of c*, cannot be 
done earlier than Cj’s assignment. 

2. Color cj has already been assigned to B (j). This might happen because 

(a) Cj has been assigned to B(j) in a previous stage. 

This implies that B (j ) has a larger number of items of color Cj with respect to 
OPt(j), he., that QjjOPT^) — Qj,s(j) — 2 ■ 

(b) Cj has been assigned to B (j) in the same stage, because it has a smaller label 
on the ring. 

By Lemma 1201 we then have that Qj t oPT(j) < 2 • Qj,B(j)- 


□ 

Theorem 22. Balance is a 3-approximation algorithm for the Distributed Balanced Color 
Assignment Problem. 


Proof. Let Cost B and CosfoPT be the cos t of the solutions given by algorithm Balance 
and OPT, respectively. We can express these costs in the following way (where, for 
simplicity, we omit index i' s range, that is always [0,n — 1]): 


Cost B 


m—1 


v v Qjt 

j=0 


y y qd + y, y, q ?,* 

jeC'i^B(j) j£C"i^B(j) 


y y Qj,i+y 

jec i^B(j) jeC" 


/ 

Q 3 .OPT(j) + 

V 


\ 

yi Qj,i 

i^OPTo), . 
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Analogously, 


( \ 

Cost OPT = ^ ] y ^ Qj,i + ^ ] QjMj) + y ' Qj,i 

j&C i^OPT(j) jeC" i^OPTfj), , 

\ / 

By definition, B(j) = OPt(j'), for j G C , and thus ]T ^ Q jti = J2 J2 Qj,u *- e -> 

jgC' ieC' ^OPT(j) 

the cost associated with color Cj G C is exactly the same for Balance and OPT. Notice 

also that the term J2 Qj,i appears in both cost expressions. Hence, to prove 

jec" i^OPTo) 

i^ B U) 

that Cost B < 3 • CostoPT, it is sufficient to show that 

Qi,OPTU) < 3 QifiU)- ( 4 ) 

jec" jec" 

We can assume without loss of generality that m is a multiple of n. Indeed, if otherwise 
n does not divide m, we can add r dummy colors (for r = m — \_m/n\), i.e. such that 
Q h% = 0 for all agents i and dummy color j. Since in our algorithm the agents consider the 
weights in decreasing order, the dummy colors will be processed at the end and therefore 
they have no effect on the assignment of the other colors. Moreover, as their weights are 
zero, they do not cause any change in the cost of the solution. 

To prove (j4j), we build a partition of the set C" according to the following procedure. 
We start from any j\ in C" and find another index j 2 such that B(j 2 ) = OPT(j') for some 
j' G C" \ {j 2 }- Note that, since m is a multiple of n, every agent must have m/n colors and 
therefore such an index j 2 must exist. If j' = j\ the procedure ends, otherwise we have 
found another index j 3 = j' such that B(j 3 ) = OPt(j"). Again, if j" = ji the procedure 
ends, otherwise we repeat until, for some t > 2, we eventually get B (j t ) = OPT^) . We 
then set 


Ci = {(jT, h), U2J3), • • •, Ut-uJt)} ■ 

If during this procedure we considered all indices in C" we stop, otherwise, we pick 
another index not appearing in C\ and repeat the same procedure to define a second set 
C 2 , and so on until each index of C" appears in one C*. Observe that each C* contains 
at least two pairs of indices and that each index j G C" appears in exactly two pairs of 
exactly one C*. 

Then, using Lemma [Til we get 


23 




jeC" 


^2 Qj, optu) = 22 22 Qj,OPT(j) 

i&C" Ci (ij')GCi 


^ 22 22 max { 2 • Qj,B(j), Qj'Xj')} 

Ci (jj')eCi 


— 22 22 Qj'p(j')) 

Ci (ij')GCi 



□ 


The following theorem shows that the approximation factor given in Theorem [5] is 
tight. 

Theorem 23. For any 0 < e < 1, there exist instances of the Balanced Color Assignment 
Problem such that COSTb is a factor 3 — 4e/(4 5 + e) larger than the optimal cost , for 
some 0 < 5 < 1. 

Proof. Consider the following instance of the balanced color assignment problem. For the 
sake of presentation, we assume that m = n and that n is even, but it is straightforward 
to extend the proof to the general case. 

Fix any rational e > 0, and let q, 5 > 0 be such that qe/ 4 is an integer and q5 = [q\. 
Consider an instance of the problem such that colors are distributed as follows: 



n 

i = 0 , 1 ,..., - - 1 


and that a o is the leader elected in the first stage of algorithm Balance, and that the 
labels assigned to agents ai,..., a n _i are 1,..., n — 1, respectively. 

Consider agents a 2 i and rj 2 *+i, for any 0 < i < | — 1. We can always assume that q is 
such that 


< qS < q{6 + e/4) < q(26 - e/4) < 


for some r. That is, the weights of color c 2 i for agents a 2 j and a 2 i+i belong to the same 
interval \p/2 r+1 ,p/2 r ). 

It is easy to see that the optimal assignment gives c 2i+ 1 to a 2i and c 2i to a 2i+ The 
corresponding cost is CostoPT = f<?(5 + e/4). On the other hand, algorithm Balance 
assigns c 2i to a 2i and c 2i+ 1 to a 2i+ \, with a corresponding cost Cost B = ^q(35 — e/4). 
Hence, for the approximation factor, we get 
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Cost B _ 35 — f _ 3 (5 + |) T + I_o 4e 
CostoPT 5+1 5 + ^ 5 + | 45 + e 


□ 


Even if the approximability result is tight, if we are willing to pay something in message 
complexity, we can get a 2-approximation algorithm. 

Corollary 24. Algorithm Balance can be transformed into a 2-approximation algorithm, 
by paying an additional multiplicative O(logp) factor in message complexity. 

Proof. Algorithm Balance is modified in the following way: colors in stage r of STEP 2 
in Phase 3 are assigned to the agent having the largest number of items (falling in the 
interval I r ) and not to the one close to the leader. This can be achieved by making the 
agent forward on the ring, not only their choice of colors, but also their QijS for those 
colors. This requires extra O(logp) bits per color, increasing total message complexity of 
such a multiplicative factor. 

For what concerns the approximation factor, this modification to the algorithm allows 
to restate the thesis of Lemma [201 without the 2 multiplicative factor and, following the 
same reasoning of Theorem [5j conclude the proof. 


□ 


Finally, if we are not willing to pay extra message complexity, but we are allowed to 
wait for a longer time, we get a (2 + e)-approximation algorithm. 

Theorem 25. Assuming m G 0(n c ), for some constant c, for any 0 < e < 1 ? there is a 
(2 + e)-approximation algorithm for the Distributed Balanced Color Assignment Problem 
with running time 0(n\og l+e p) and message complexity Vt{nm). 

Proof. Modify the two interval threshold values of algorithm Sync-Balance in the follow¬ 
ing way: 



and redefine 



(5) 


Accordingly, the statement of Lemma [20] becomes Qj. t < (1 + e)Qj,k, and the statement 
of Lemma [21] can be rewritten as 


Qj,OPT(j) < rnax{(l + e) • Qj^j), Q 


The result on the approximation factor then follows by the same arguments of the 
proof of Theorem [5] The message complexity is not affected by these changes, while the 
running time now depends on the number of stages in Phase 3, that is 0(log 1+e p). □ 
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6 Conclusion 


In this paper we have considered the Distributed Balanced Color Assignment problem, 
which we showed to be the distributed version of different matching problems. In the 
distributed setting, the problem models situations where agents search a common space 
and need to rearrange or organize the retrieved data. 

Our results indicate that these kinds of problems can be solved quite efficiently on a 
ring, and that the loss incurred by the lack of centralized control is not significant. We 
have focused our attention to distributed solutions tailored for a ring of agents. A natural 
extension would be to consider different topologies and analyze how our techniques and 
ideas have to be modified in order to give efficient algorithms in more general settings. We 
believe that the main ideas contained in this work could be useful to extend the results even 
to arbitrary topologies. Indeed, an 0{n polylog(n)) distributed leader election protocol 
(that is needed in our algorithm) is also available for arbitrary ad hoc radio networks [1]. 

For what concerns the ring topology, it is very interesting to note that the value p never 
appears in the message complexity for the synchronous case (not even if the polynomial 
relation between m and n does not hold), while a factor logp appears in the asynchronous 
case. It is still an open question if it is possible to devise an asynchronous algorithm that 
shows optimal message complexity, under the same hypothesis of the synchronous one; 
i.e., if it is possible to eliminate the extra logp/logn factor. 

Acknowledgments. The authors wish to thank Bruno Codenotti for many helpful 
comments and discussions. 
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